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Abstract 

The incomplete statistics for complex systems is characterized by 
a so called incompleteness parameter tu which equals unity when in- 
formation is completely accessible to our treatment. This paper is 
devoted to the discussion of the incompleteness of accessible informa- 
tion and of the physical signification of u on the basis of fractal phase 
space, u is shown to be proportional to the fractal dimension of the 
phase space and can be linked to the phase volume expansion and 
information growth during the scale refining process. 

PACS number : 02.50.Cw,02.70.Rr,89.70.+c,89.75.Da 



Introduction 

A fundamental hypothesis in Boltzmann statistical physics, explicit or 
not, is that the probability distribution is complete so that Y^LiPi = 1, 
where w is the number of all the possible states labelled by i and pi is the 
probability that the system is at the state i. This is the complete probability 
normalization^. In other words, one can treat a complete or asymptotically 
complete set of states of the system under consideration. This hypothesis of 
complete statistics implies that the following two conditions must be satisfied. 
1) All possible states of the system of interest are well known, which requires 
a complete knowledge of the dynamics of the system. 2) The known states are 
accountable and calculable, or in other words, the information is accessible 
to us. Are these two conditions always satisfied? 

As the study of complexity advanced, it is revealed that chaotic and frac- 
tal behaviors are ubiquitous in nature EJ IH El E], and that a fractal phase 
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space in general cannot be completely and exactly exploited. Any calculation 
based only on the accessible states or the differentiable (integrable) points 
in this phase space is necessarily incomplete due to the rejected or unknown 
(singular or unaccessible) points. So the complete descriptions may be satis- 
factory if and only if these unaccessible points are negligible. If it is not the 
case, these points should be taken into account or the theoretical descriptions 
would be aberrant. The question is : how to include the states of a physical 
system in a theory when they are not accessible to us? 

Recently, we have developed a method in this direction |"J H2 EH EEU 
112*1 EI] : a statistical mechanics based upon the notion of incomplete in- 
formation which has been shown capable of giving coherently the Tsal- 
lis g-exponential distributions within the nonextensive statistical mechanics 
(NSM)jH] and, moreover, useful quantum distributions for some systems of 
correlated electrons"""","""!]. In the present paper, I am presenting some argu- 
ments to support the hypothesis of information incompleteness. The physical 
significations of the incompleteness parameter will be discussed on the basis 
of fractal phase space. 

Incomplete information hypothesis 

In the conventional probabilistic science, it is supposed that our igno- 
rance, or information, is completely accessible and may be calculated by, 
e.g., the Hartley formula ij = ln(l/pj) and Shannon information measure I = 

Pi hr(l/pj)[IIJ""""""|, as wei l as by other generalized information measures ["Q 
CnillZllIHllini, under the harsh condition that all possible states are accessible 
to us so that all probabilities sum to one. 

However, the hypothesis of incomplete information admits simply that 
a part of our ignorance about complex system may not be accessible to 
our treatment. It cannot be calculated from probability distributions. In 
other words, the information calculated by the above mentioned method 
on the basis of the complete probability normalization may be incomplete 
because here we only have J21=iPi — ^DP where v is the number of the 
accessible or accountable states and may be greater or smaller than w, the 
total number of states. Q represents the incompleteness of the treatment 
and necessarily linked to the nature of the system. Logically, the origine of 
this incompleteness may be attributed either to the partial knowledge of the 
dynamics or to the unaccessible (incalculable) states of the system. 

Incomplete information hypothesis has been motivated originally by some 
fundamental problems encountered in NSM[""J |J3in||I2|II3IIISIIIZlCEIIISl- 
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It replaces the complete probability normalization by[S] 

X>r = i, a) 

i=i 

where uo is referred to as incompleteness parameter which equals unity if the 
probability distribution is complete pQ. Then on the basis of the Hartley ad- 
ditive information measure Jj = ln(l/pj) and the nonadditive generalization 
of Hartley formula = ln w (l /pj) = ~~\ two incomplete entropies 

S^i = -kEiPt^Pi and = fc 1 "^ (w G i2) Ej are deduced. These 
formalisms not only give in a coherent way the nonextensive incomplete dis- 
tributions for canonical ensemble Pi oc [1 — (1 — co)x] u ' 'A 1-£J ) which has been 
proved very useful for many systems having non-gaussian distributions, they 
also lead to quantum distributions of which the first results of the applications 
to correlated electron systems seem quite interesting and promising[9| II 1| ITH] . 

With the incomplete normalizationEq.^ , the parameter to can be uniquely 
related to the incompleteness Q by 

|>r + (^-x>r = 1, (2) 

i=l i=l 

for any < pi < 1, v and Q. Figure 1 shows the relation between uo and Q for 
a given incomplete distribution {pj=i...5} = {0.1, 0.15, 0.2, 0.25, fl — 0.7}. We 
see that, in general, Q > 1 and Q < 1 lead to u > 1 and a; < 1, respectively. 
If Q is such that a certain p& — > 1 among {p i=1 5 }, then c<j — > oo. If O — > 0, 
-»• 0. 



Incomplete normalization in its general form 

In general, we should write 

E*Uft) = i ( 3 ) 

i=l 

in order to normalize incomplete distributions. The function should de- 
pend on the nature of the system and become identity function whenever 
information is supposed complete (Q = 1). The arithmetic average of a 
quantity x should now be given by 

V 

x = Y / F U! (p i )x i . (4) 

2=1 
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Figure 1: uj — Vt dependence for a given distribution {pi=i...s} = 
{0.1, 0.15, 0.2, 0.25, tt - 0.7}. Note that u -»• oo when Q -> 1.7 in this case. 



where x« is the value of x at the state i. 

may be determined if the information measure and the distribution law 
are given. For example, with Hartley information measure and exponential 
distribution, F u can be showed to be identity function PJ. In general, by 
entropy maximization through the functional 

8\EF u (p i )I(p i )+a^F u (p i )+P'EF u (p i )x i ]=0 (5) 

i i i 

we get : 

dlnF^jpi) _ dl/dpi 

d Pl ~I + a + f^( Pl ) U 
or F w (pi) = Cexp[J Trffir r t/pj where a and /3 are the multipliers of La- 

l+PJu [Pi) 

grange in Eq.(J5J), I{pi) is the information measure, = /^(xj) the distri- 
bution function depending on the parameter cj, and C the normalization 
constant of F u . 

Within this frame, we would be able to group many entropies [20] into 
the family of incomplete entropies if we suppose that the Hartley formula 
Ii = ln(l/pi) holds. For example, the additive entropy proposed by Gor- 
ban and coworkers |21j to generalize Gibbs-Shannon entropy and to obtain 
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the long and short tail distributions may be considered as the expectation of 
Hartley information with F a (pi) = Pi — a(pi + l) for an incomplete description 
discussed by the authors. In this case, the incompleteness of the description 
may be given by Ya=\Pi — T^f- When a = 0, the description becomes com- 
plete and Gorban entropy recovers Gibbs-Shannon one. The second example 
is the entropy due to Burg for solving geophysical problems [22] : S oc J2i hipj. 
To have this, we may put F^ipi) — 1/v which is a pseudo-equiprobability. 
Other examples are the entropies of Belis-Guiasu|22] with F^Pi) = WiPi for 
the study of the information in cybernetic systems and the entropies of Aczel- 
Daroczyj21] with F^Pi) = p\- So the incomplete normalization seemingly 
provides a plausible approach to link certain special entropies to elementary 
information. 

About incomplete information in fractal phase space 

Through a detailed analysis of the entropy and information on fractal 
supports, Naschie[2I)] shows a connection between the information and the 
topological dimension of some fractal sets. It is also shown that the dynamics 
of this kind of phase space is quasi-ergodic for n > 4 where n is the dimension 
of the set or the number of particles in the fractal phase space of topological 
dimension <i/ [23J . The results of the analysis tell us that the Boltzmann 
entropy and Gibbs-Shannon entropy for Cantor set increase with increasing 
capacity dimension given by d^ 1 ' = (l/<ij) n ~ 1 which in turn increases with 
increasing dimension n of the fractal set. On the other hand, d^f> should 
increase with decreasing topological dimension df. This behavior is similar 
to that of S u i and with respect to the parameter cujHJUU], which means 
that uj may have something to do with the topological dimension in fractal 
phase space. 

In what follows, the information on fractal support will be studied from 
a different angle with the help of the incomplete normalization Eq.Q. First 
of all, we will show that this kind of power normalization is inevitable for 
the calculation of fractal information. 

An incomplete normalization 

For the sake of simplicity, let us consider a phase space in which the 
trajectory of a complex system forms a self-similar fractal structure, say, 
Sierpinski carpet (Figure 1). This means that the state point of the system 
can be found only on the black rectangular segments whose number is Wk = 
8 h at k th iteration. Hence the total surface at this stage is given by Sk = W^Sk 
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Figure 2: A simple model of fractal phase space in Sierpinski carpet which 
is a self-similar fractal structure generated by dividing, at the first iteration, 
a full square of side Z into 3 2 smaller squares of equal sides of Z /3 and 
removing the central small square. This procedure will be repeated on every 
small square at the next iteration and so on. At k th iteration, the side of the 
squares (black or white) is Ik = Zo/3 fc and their number is Wk = 8 k , lo being 
the length of the side at th iteration. The total surface at k th iteration is 
Sk = WkSk or WkSk/Sk = 1. The classical probability definition by relative 
frequency of visits of each point by the system should be modified because the 
total number of visits (proportional to black surface Sk of the carpet) is no 
more a finite quantity. (Construction of Sierpinski carpet. First iteration c(l) 
: removing the central square formed by the straight lines cutting each side 
into three segments of equal size. Repeat this operation on the 8 remaining 
squares of equal size and so on.) 
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where Sk = lo/3 k is the surface of the segments at k th iteration and lo the 
length of side of the square space at th iteration. If the segments do not 
have same surface, we should write Sk = Ya=i Ski})- 

Now let us suppose that the density of state is identical everywhere on 
the segments and that the dynamics in this phase space is quasi-ergodic : 
every point on the segments is equally visited, so that the probability for the 
system to be in the i th segment may be defined as usual by Pi = Sk(i)/Sk- 
This probability is obviously normalized. The problem is that, as discussed 
in pj, Sk is an indefinite quantity as k — > oo or as the time t — > oo. So strictly 
speaking, it can not be used to define exact probability definition or all the 
probabilities would be null or lose its additivity and normalization according 
to the countable additivity axiom of probability or total probability axiom 
which asserts that probabilities can be additive on a set of finite cardinals 
(number of elements). In addition, Sk is not differentiable and contains 
inaccessible points. Thus the probability defined above makes no sense. 

Alternatively, the probability may be reasonably defined on a integrable 
and differentiable support, say, the Euclidean space containing the fractal 
structure. To see how to do this, we write Sk = ^o(-^k) d ~ df for identical 
segments or, for segments of variable size, 

U^) df/d = 1 (7) 

where So = Iq (here d = 2 for Sierpinski carpet) is a characteristic volume of 
the fractal structure embedded in a d-dimension Euclidean space, df = is 
the fractal dimension, n = 8 is the number of segments replacing a segment 
of the precedent iteration and m = 3 the scale factor of the iterations. 

The micro canonical probability distribution at the k th iteration can be 
defined as 

Sk(i) 

Pi = 

so that 

Edf/d 1 

i=l 

This is just Eq.flJ) with u = df/d. The complete probability normalization 
J2i=iPi — 1 can be recovered when df — d. 

It should be noticed that, in Eq.((7j), the sum over all the Wk segments 
at the k th iteration does not mean the sum over all possible states of the 
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system under consideration. This is because that the segment surface Sk{i) 
does not represent the real number of state points on the segment which, as 
expected for any self-similar structure, evolves with k just as Sk- So at any 
given order k, the complete summation over all possible segments is not a 
complete summation over all possible states. But in any case, whatever is k, 
Eq.(J7J) and J2i=iPf — 1 always hold for u = df/d. 

In this simple case with self-similar fractal structure, the incompleteness 
of the normalization Eq.(JJ) is measured by the parameter uj = df/d. df > d 
means that the system has more states than Wk, the number of accessible 
states at given k. If df < d, the number of states is less than Wk- When 
df = d, the summation is complete at any order k, corresponding to complete 
information calculation. 

uj and phase space expansion 

Now we will discuss in a detailed way the incompleteness parameter uj 
and its physical meanings. Let us discuss this still in the case of self-similar 
fractal phase space. 

As discussed in the case of chaotic phase space, uj = In n/d In m gives a 
measure of the incompleteness of the state counting in the <i-dimension phase 
space, lo — 1 means df = d or n = m d . In other word, at the k th iteration, 
a segment of volume Sk is completely covered (replaced) by n segments of 
volume Sk+i = Sk/m d . So the summation over all segments is equivalent 
to the sum over all possible states, making it possible to calculate complete 
information. 

When tjj > 1 (or u < 1), n > m d (or n < m d ) and Sk is replaced by n 
segments whose total volume is more (or less) than Sk- So there is expansion 
(or negative expansion) of state volume when we refine the phase space scale. 
An estimation of this expansion at each scale refinement can be given by the 
ratio r = WSfc+1 ~ Sfc = - 1 = (^r) 1 ^ - 1 = (u - l) ^"" 1 " 1 . r describes 
how much accessible states increase at each step of the iteration or of the 
refinement of phase space. The physical content of uj is clear if we note 
that uj > 1 and uj < 1 correspond to an expansion (r > 0) and a negative 
expansion (r < 0), respectively, of the the accessible state volume at each 
step of the iteration. 

When uj = 0, we have df = and n = 1, leading to r = -\ — 1. The 
iterate condition n > 1 means uj > 0, as proposed in references [H]. OJ < is 
impossible since it means (i/ < or n< 1 which obviously makes no sense. 
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We can also write : u — 1 = ln(r + l)/ln(m d ) = ln(ns/ c+ i/s/ c )/ln(m d ), which 
implies that it is the difference uj — 1 which directly measures the accessible 
state space expansion through the scale refinement. 

uj and information growth 

The expansion of the accessible state volume of a system in its phase 
space during the scale refinement should be interpreted as follows : the extra 
state points A = ns k+ i — s k acquired at (k + l) th order iterate are just the 
number of unaccessible states at k th order with respect to (k + l) th order. 
A > (or A < 0) means that we have counted less (or more) states at k th 
order than we should have done. A contains the accessible information gain 
(AIG) through the (k + l) th iterate. 

To illustrate the relation between this "hidden information" and the pa- 
rameter uj, let us suppose that the distribution is scale-invariant[27\. At 
the iterate of order k, the average information contained on Sk is given by 
h = ! Sk p UJ I{l/p)ds. At k + 1 order, I k+1 = f nSk+i p Ui I(l/p)ds. Hence AIG 
is just AI = I k+1 -I k = f {nSk+i _ Sk) p»I(l/p)ds = a! A, where a 1 = p"I{l/p) 
is the information density or the average information carried by each state. 
The relative AIG is given by AI/I k = r = (1 — uj) ^ImJ— - 1 w hich is in- 
dependent of scale but dependent on scale changes. For given scaling factor 
m, the magnitude of AI or r increases with increasing difference |1 — u\. 
The sign of r (or AIG) was discussed earlier. For given uj, \AI\ increases 
with decreasing scaling. For uj — 1 or m — 1, there is no information gain, 
corresponding to the case of complete information. According to the above 
discussions, the incompleteness parameter uj may be considered as a measure 
of the degree of chaos or fractal. 

Conclusion 

Summing up, we have argued that the information of a complex system 
we deal with may be incomplete because we only partially know the dynamics 
of the system or a part of the states of the systems is not accessible to our 
calculation. The parameterized normalization J2iPt = ^ is justified on fractal 
support, where the incompleteness parameter u is shown to be proportional 
to the fractal dimension of the phase space and can be directly linked to the 
phase volume expansion and information growth during the scale refining 
process. 
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